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Abstract. We produce a collection of families of curves, whose 
point count statistics over ¥p becomes Gaussian for p fixed. In par- 
ticular, the average number of Fp-points on curves in these families 
tends to infinity. 



1. Introduction 

The purpose of this note is to exhibit a collection of families of 
smooth curves whose normalized limiting point count statistics, over a 
fixed finite field Fp, have a Gaussian distribution. Given a finite fam- 
ily J^i of smooth curves defined over Fp, let Mi := j^^^ceJ^t l^(^p)l 
be the average number of Fp-points on the curves C & J^i and let 
Vi := Scg.f, (l^(^p)l ~ be the variance of the fluctuations in 
these point counts. Here, and in what follows, C(Fp) will denote the set 
of Fp-points on C, and |C(Fp)| its cardinality. We can now formulate 
the main result of this note. 

Theorem 1. There exists a sequence of families {J^i}"^^ of smooth 
curves defined over Fp with the following properties: \J-'i\, Mi,Vi all 
tend to infinity, and, for all compact intervals I , 



I 



I =-1=1 e-^'/'dx + o{l), 



as t ^ oo. 



To obtain such a sequence we intersect projective surfaces Xj C P""*, 
chosen so that |Xj(Fp)| tends to infinity as i — )■ oo, with families of large 
degree hypersurfaces. More precisely, let Si{d) be the set of degree d 
homogeneous polynomials in rij-l-l variables with coefficients in Fp. For 
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/ G Si{d), let Hf C P"' be the hypersurface defined by the zero set of 
/. Intersecting Xj with Hf we generically obtain a (possibly singular) 
curve Ci{f) := Xi fl Hf. Letting 

Si{d):= {f e Si{d) -.XiDHf is smooth}, 

we obtain a family of smooth curves 

T,id) := {Qif) : / G S^id)}. 

Our main technical result (in essence a slightly more explicit version 
of Poonen's [16:, Theorem 1.2]) asserts that the distribution of point 
counts for curves in this family, for d large, is binomial — we can think 
of it as the number of successes when an unfair coin is tossed |Xj(Fp)| 
times. 

Proposition 2. Let Xi C P"' be a smooth projective surface defined 
over¥p, and let Fi{d) he defined as above. Then, as d oo, 

\{Cem)--\C{¥,)\ = s}\ 
\Hd)\ 

/iAm)iu^yA^y---'. 

\ s J VP +P+IJ \ + P+1J 
uniformly for < s < |Xj(Fp)|. In particular, the average point count 
of a curve C E J^i{d) equals |Xj(Fp)| ■ ^i^^-^ ■ (1 + o(l)) as d ^ oo. 

Given Proposition [2], we can easily obtain Theorem [1] by the central 
limit theorem type argument for coin fiip models (cf. Section [2]) pro- 
vided we can find a sequence of surfaces {Xj}j>i such that |Xj(Fp)| — > 
oo. Any such sequence suffices, but for concreteness we will use Ihara's 
construction [10] (independently discovered by Tsfasman, Vladu^, and 
Zink [18j), of families of curves with many points over Fp2, and a (Weil) 
restriction of scalars argument will then produce a projectively embed- 
ded Fp-surface having many points — see Section 12.21 for more details. 

Remark: Since the relation |C(Fp)| = p + 1 - Tr(Frob \Hl{CMi)) 
holds between the number of Fp-points on a curve C and the trace 
of Frobenius (see e.g [TTl Ch. 11]), we have in fact exhibited fami- 
lies of smooth curves for which the average of the trace of Frobenius 
exhibits a strong negative bias. (Note that the average trace of Frobe- 
nius should be zero according to random matrix theory predictions; cf. 
Section [LTD) 
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1.1. Background and discussion. 

1.1.1. Gaussian point count statistics in other models. The number of 
Fp-points on a smooth curve C of genus g, defined over a finite field 
Fp, can be written as 

|C(Fp)|=p + l-^«, 

i=l 

where G Q are the eigenvalues of the Frobenious action on a certain 
cohomology group. By Weil's proof of the Riemann hypothesis for 
curves, = ^/p for 1 < i < 2g. Hence |C(Fp)| = p + l—p^^"^ ■Ti{Uc), 
where Uc € U{2g) is a unitary 2g x 2g matrix, unique up to conjugacy. 
If we let C range over a family J^(Fp) of smooth curves defined over Fp 
(e.g., the family of hyperelliptic curves J-" = {Ct}t&p:f{t)y^o where Ct = 
{{x,y) : y'^ = f{x){x — t)} and / G ¥p[x] is a square free polynomial 
of degree 2g) it is natural to study the distribution of the fiuctations 
of the points counts by looking at the normalized fluctuations, i.e., the 
quantity (|C(Fp)| -p- 1)/^ = - Tt{Uc). 

By Deligne's equidistribution theorem, the distribution of the conju- 
gacy classes of the Uc^s, as p — )■ oo, are given by random matrix theory 
when the family has "large monodromy". For example, Katz and Sar- 
nak [13] has shown that for the family of hyperelliptics given above, 
the limiting distribution on the t/c-conjugacy classes, as p — )• oo, is 
given by the Haar measure on USp{2g), the group of unitary sym- 
plectic 2g x 2g matrices. On the other hand, in the limit g oo, 
Diaconis and Shahshahani has shown [0] that the limiting distribu- 
tion of {TY{U)}u^usp(2g), as (? — )■ oo, is a Gaussian with mean zero and 
variance on^. Thus, if a collection of families of curves have large mon- 
odromies, then the (normalized) point count fluctuations are Gaussian 
in the double limit limg^oo linip^oo- 

For p fixed and — )■ oo it is less clear what to expect regarding the 
distribution of the f/c-conjugacy classes and their traces. For instance, 
random matrix theory (RMT) is clearly not an appropriate model since 
the inequality < |C(Fp)| = p + 1 — ^/p ■ Tr(f/c) does not hold for 
all Uc G USp{2g) for g sufficiently larg^. Similarly, if the curves 
in the families can be embedded into P"'(Fp) for n fixed, the bounds 

^It is rather remarkable that even though Tr([/) is a sum of 2g complex numbers 
on the unit circle, the variance does not scale as y/g, but is in fact identically equal 
to one. 

^However, it is worth noting that certain statistics of the eigenvalues are con- 
sistent with RMT, e.g., the fluctuations of the number of eigenvalues in random 
short intervals (cf. [7]). Moreover, in [T7] Rudnick found that the one-level density, 
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< \Ci¥p)\ = p + 1 - ^ ■ Tt{Uc) < |P"(Fp)| rules out a Gaussian. In 
fact the normalized distribution cannot even have continuous support 
since |C/(Fp)| is integer valued and the variance is bounded. 

To get some insight into the large genus limit while keeping the 
ground field fixed, Kurlberg and Rudnick studied (cf. [13]) families 
of hyperelliptic curves of the form Cf : y"^ = f{x), where / ranges 
over monic square free polynomials of degree d. They found that the 
fiuctuations in this family, as — )• oo, has the same distribution as 
Xi, where Xi, . . . ,Xp are independent random variables taking the 
values 0,-1,1 with probabilities l/(p + 1), 1/2(1 + p"^), 1/2(1 + p"^), 
respectively. Moreover, the moments for the normalized point count 
distribution were shown to be Gaussian as long as both p, d — )■ oo, i.e., 
any joint limit rather than letting p tend to infinity first (as described 
above.) 

In |3], Bucur, David, Feigon and Lalm generalized this to cyclic l- 
fold covers of P^; here the distribution of the fiuctuations are given 
by 2 Re(^f^j^ Xi) where independent random variables 

taking the values 0, exp(27ri//), . . . , exp(27rz(/ — 1)//) with probabilities 
2/(p+2),p/(3(p+2)), . . . ,p/(3(p+2)), respectively. Further, in [2J, they 
studied the family of smooth curves in the projective plane cut out by 
degree d homogenous polynomials, and found that the distribution of 
the point count statistics for this family, as d — )■ oo, is the same as that 
of Y7i=i'^^ where xi, . . . , Xp2_,_p+i are independent random variables 
taking the values 0, 1 with probabilities p^/ (p^+p+1), (p+l)/(p^+p+l), 
respectively. In both [31 [2], Gaussian moments were obtained when p 
tends to infinity; in the first case along any limit p,d — )■ oo, whereas 
the assumption d 3> p^"*"" is needed in the second case. 

We finally note that in [TB] (unpublished), Larsen obtained Gaussian 
moments for a smooth family of hyperelliptic curves of the form = 
nr=i(^ ~ '^j)' where ai, . . . , a„ G F^, ranges over distinct elements. 

1.1.2. Remarks on vanishing probabilities. Given a point P G Xj(Fp), 
the probability of a polynomial / G Si{d) vanishing at P is 1/p, so 
one might expect that the average number of Fp-points on Xj fl Hj 
should equal \Xi(Wp)\/p. However, as we have seen, this prediction is 
not quite correct — by conditioning on / so that Xj fl Hf is smooth, 
the probability of / vanishing at P turns out to be slightly smaller 
than expected, and is given by ^2^^^ (rather than by 1/p). A similar 
phenomenon was already observed in [2] for smooth plane curves: the 
probability of point P G P^(Fp) belonging to a smooth curve given by 



a local statistics, was in agreement with RMT (even though averages of traces of 
small powers was not.) 
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the zero set of a random homogenous polynomial (of large degree) is 
(p + + p + 1), rather than 1/p. 

1.1.3. Remarks on families of curves with many points. In order to ob- 
tain Gaussian (normalized) point count statistics, it is essential that 
there is no a priori upper bound on the number of Fp-points on the 
curves; in particular, families of plane curves, families with bounded 
genus, or families with bounded gonality cannot be used. In fact, some- 
thing even stronger is needed: since point counts are integer valued, 
the variance must grow to infinity for the normalized distribution to 
have continuous support. Thus, since the limiting distribution must be 
symmetric around the mean, together with the fact that the number 
of points on a curve is non-negative, the average number of Fp-points 
of a curve in the family also must tend to infinity. 

A natural candidate for families of curves with unbounded point 
counts over Fp is Mg(Fp), the set of isomorphism classes of genus g 
curves. However, if these point counts can be modeled by random 
matrix theory (e.g., as in the case of g fixed and p — ?■ oo as shown in 
|13]). the average number of points would be p + 1 since the average 
trace of Frobenius equals zero by random matrix theory predictions; 
hence it is unclear whether using Mg(Fp), (7 — oo as a collection of 
families would work. (Also see [H [12] for explicit results on average 
point counts, as p — )■ 00, for curves in various families.) 

Another possibility to avoid a priori upper bounds might be to con- 
sider smooth curves given by intersecting n — 1 generic hypersurfaces 
in P". However, the average number of points on curves in this family 
turns out to be bounded; Bucur and Kedlaya recently [4j showed that it 
is slightly less than p + 1. In particular, the average trace of Frobenius 
for curves in this family is not equal to zero. 

1.1.4. Acknowledgments: The authors would like to thank Alina Bu- 
cur, Chantal David, Nicholas Katz, and Zeev Rudnick for helpful dis- 
cussions. 



2.1. Normal distribution from coin flip model. We first recall 
some facts about the binomial distribution. For i > 1, let -Bj be the 
binomial random variable counting the successful tosses among 



2. Proof of Theorem [T] 



(2) 
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tosses of an unfair coin, whose probability of success equals 

(3) r - 

Let Mj' and V/ be the expected value and the variance of the number 
Bi of successful tosses respectively. That is, M- := ¥.[Bi] = rii ■ r and 
VI := Var{Bi) = rii ■ r(l — r). It is a classical result in probability (see 
e.g. [8], chapter 7) that -Bj, suitably normalized, tends to the standard 
Gaussian, provided that — )■ oo (note that the probability of a success 

stays constant). Namely, — t- A/ (0, 1) in the sense that for any 
compact interval / CM, 

(4) Prob I ^ ^) ^ vb / 



We are now in position to prove Theorem [H 

Proof of Theorem [I] assuming Proposition 0. Recall the definitions of 
Ui and r in Equations (j2]) and ([3]) respectively. For the coin flip model 
the probability of precisely k successes is given by 



Prob(5i = k)= l^^jr^il-r 

whereas, by Proposition |2l Prob (|C(Fp)| = k) for our family of curves 
equals 

'^^^/•(1-rr-'- (1 + 0(1)) 

with constant involved in the 'o'-notation being uniform in k. In par- 
ticular, we find that 



= ^ E (ic'(^p)i - M^d" = v: ■ (1 + 0(1)). 

Further, 



P„b(MM^,A. j2 (:)^'(i-0"'-Ni+o(i)) 

keAU+I-Vin[0,ni] 

SO by comparing with the corresponding sum in the coin flip model and 
using the classical (jl]), together with 



' M r' (1 - .•)"-' = 1 

fc=0 ^ ^ 

to control the error term, we obtain the statement of Theorem [TJ 
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□ 

2.2. Surfaces with many points. A crucial assumption in Theo- 
rem [1] is the existence of a sequence of surfaces {Xi} whose point counts 
over ¥p tends to infinity; here we give a concrete example of such a se- 
quence. We begin by recalling the construction of modular curves with 
many Fp2 -points given in [T8] . 

Theorem 3. If I is a prime greater than p, there exists a smooth com- 
plete curv^ Xo{l), defined over ¥p, having at least {p — + 1)/12 
points over ¥p2 . 

If C is curve of genus g > I, and C is not hyperelliptic, there exists 
a canonical embedding : C — )■ P^~^, whose image is a curve of degree 
2^-2 inside P^'^ (cf. [9, Ch. 4.5]). Now, the genus of Xo{l) equals 
gi = [1/12], and since a hyperellipic curve has at most 2{p + 1) points, 
Xo(/) clearly cannot be hyperelliptic for / sufficiently large. Thus Xq{1) 
can be canonically embedded into P^'~^ for / large. 

Letting /j denote the i-th prime we now define a sequence of surfaces 
{Xi} by letting X^ be the Fp-surface obtained by Weil restriction of 
scalars, from Fp2 to Fp, of Xo(/i). As remarked above, Xo(/i)/Fp2 is 

canonically embedded into Pp' ^, and it is known (e.g., see Lemma 7.5 

2 

in Ch. L7.2 of [5]) that PJJ ^ can be projectively embedded in P^""''"'^^ ^, 
hence each X^ is a projective surface with |Xj(Fp)| = |Xo(/i)(Fp2)| > 
{p — + 1)/12 for i sufficiently large. 

3. Proof of Proposition [2] 

We begin with some notation: given a variety X defined over F^, let 
Cx{s) denote the zeta function attached to X, i.e., 

Cxis):= n (I-P^'^^^^^l^^ 

P closed 

For Ci{f) = Xj n Hf to be smooth, Hj must intersect Xi transver- 
sally at all points P G Xi(F^) such that f{P) = 0. Le., if f{P) = 
for P G X(Fpfe) and we write f\xi in local coordinates x,y (say with 
P corresponding to x = y = 0), we must have f\xi = ax + by + 
(higher order terms), where a,b E Wpk and (a, 6) 7^ (0,0). In other 
words, f\xi must not have quadratic order of vanishing at any point 

^Xq{1) is the projective smooth model of the modular cm^ve Yq{1), which 
parametrizes pairs {E, C) where E is an elliptic curve and C C is a cyclic 
subgroup of order I. 
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P G Xi(¥pk) (for any k), or equivalently, the image of / in Ox,p/'(nj^ p 
must be nonzero for all P G Xi{¥pk) and all k. 

Since X is a smooth surface, \Ox,p/vnx^p\ = p^^ for all P G Xj(Fpfe), 
so the "probability of smoothness at P" equals 1 —p~^^; if these condi- 
tions are sufficently independent as P varies, the probability of Ci{f) 
being smooth should be given by 1/^x^(3). Using a sieving argument, 
Poonen showed that this heuristic indeed gives the correct answer when 
d oo and / ranges over elements in Si{d). Further, his "Finite field 
Bertini with Taylor coefficients" Theorem 1.2] allows for controlling 
the behaviour of / in the neighborhood of a finite number of points, best 
formulated in terms of schemes. We shall need the following slightly 
more explicit version: 

Theorem 4. Let X be a quasiprojective subscheme of P" over ¥q, 
and let S{d) be the set of degree d homogenuous polynomials in n + 1 
variables. Let Z be a finite subscheme o/P", and assume that U : = 
X — [Z r\ X) is smooth of dimension 2. Fix a subset T C H^{Z, Oz)- 
Given f G S{d), let f\z be the element of H^{Z,Oz) that on each 
connected component Z^ equals the restriction of xj'^f to Z^, where 
j = j{k) is the smallest j G 0,1, ... ,n such that the coordinate Xj is 
invertible on Z^ . Then as d —t- oo 



|{/ G S{d) : Hf nU is a smooth curve, and f\z G T}\ 

\S{d)\ \T\ 



Proof. We will closely follow the closed point sieve of [161 Section 2]. 
The case T = is trivial, so we may assume that |T| > 1. Let Ur be 
the set of closed points of U of degree < r, and define U>r similarly. 
By the proof of [16, Lemma 2.2], for r fixed and d sufficiently large. 



|{/ G S{d) -.RfHU is smooth at all P e Ur and f\z G T}| 



\S{d)\-\T\ 



n (^-p 



3deg{P)^ 



which in turn equals 

1^(^)1 -m {i+o{p-'n) 

\H%Z,Oz)\ (u{S) 

Letting r grow with c? in a suitable fashion, we obtained the claimed 
main term; to conclude the proof it is enough to bound the number 
/ G S{d) for which smoothness of Hf nU is violated at some point P 
of degree larger than r. 
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Medium degree points: By the proof of flGl Lemma 2.4], 

|{/ G S{d) : HfOU is not smooth at some P G f/>r H U^d/s}] <c/ \Sid)\-p'''. 

Large degree points: By the proof of 1,16, Lemma 2.6] (in particular, 
see Claim 1 and Claim 2), there exists r e Z"*", only depending on U, 
such that 

|{/ G S{d) : Hf n f/ is not smooth at some P G t/>d/3}| 

= \S{d)\ ■ 0{dp-^'^-^^/P + (fp-^H[d/p]+hdm^ 
which, for d sufficiently large, is 

Now, taking r = [d/A], we find that the total contribution from medium 
and large degree primes is ^ \S{d)\p~'^^^ and the result follows by 

taking d sufficiently large so that = o (^ \ho{z Oz)\(u(3) ) ' 

□ 

Remark. In particular, taking Z to be empty, we obtain 
(5) \S{d) \ = \{f e S{d) : HfHX zs a smooth curve}\ 

which can be interpreted as saying that the probability of Hf flX being 
smooth is 1/Cx(3) + o(l) for X a surface. 

3.1. Applications to point counting. To apply Theorem HJ we de- 
fine a finite subscheme Z C X as follows: let X{¥p) = {Pi, . . . ,Pt}, 
and for 1 < i < t let tTij be the ideal sheaf of Pi on X. Let Zi be the 
closed subscheme of X corresponding to the ideal sheaf C Ox, and 
define Z := U'^^Zi. Note that H^{Z, Oz) ^ Ul=i H^{Zi, Oz,) and also 
that H'^{Zi,Ozi) — Ox,Pi/vcv\ p_] in particular 

\H\Z,, Oz;)\ = \Ox,pJmx,P^ ■ |mx,P./m^,pJ =p.p'=p\ 

Given a collection of s points W C X(Fp), we define a subset T{W) C 
H^{Z,Oz) using the above isomorphisms: let T{W) := ni=i where 
T, C H\Z,, Oz:) - Ox,pJrn\p^ is given by 



T,; := 



{(f>emx,pJm\p^:ct)^Q} if G 1^ 

{(t>eOx,pJm\p^:ct>^Q modmx,pJ if P. G X(Fp) - 



Note that \Ti\ = p"^ - I ii Pi e W , and \Ti\ = p^ - p"^ if P^ ^ W, 
hence \T{W)\ = {p^ - l)"(p3 - p^f-' . We also note that if / G S{d) 
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and f\z e T, then {Hf fl X)(Fp) = W, and if/ n X is smooth at all 

P^eW. 

Letting W range over all subsets of X(¥p) of cardinality s, define a 
subset T C H'^iZ, Oz) by 

T := yJwcix(¥p):\w\=sT{W). 

Then |T| = (*) ■ (p^ - iy{p^ - p^y~\ and, by construction, f\z G T 
is equivalent to X fl Hj being smooth at all points in X(Fp) and that 

\xnHf \ = s. 

By Theorem H] and ([5]) , together with 

Cxis) = Cu{s) n {i-p~T' = Cuis){i-p-T' 

we find that for c/ — )■ oo, 

|{/ e 5(ci) : |X(Fp) nHf \ = s and X n Hf is smooth }| 



|{/ G : X n ii/ is smooth }| 
\S{d)\/Cu{S)-\T\/\H^{Z,0{Z))\ 



\S{d)\/(x{S) 



1 + 0(1)) 



^ |T|/|/iO(Z,0(Z))| 

which, since |r| = (*) • {p^ - lY{p^ -p^-' and \H%Z,0{Z))\ = p^\ 
equals 

Q(p2-l)'(p!'-p2)'- 



■(1 + 0(1)) 



t\ f p +1 \^ f P 



J \p'^ + p+1 J \p^ + p + 1 
and hence concluding the proof of Proposition [2j 
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